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Introduction 


Germline  mutations  in  BRCA1  lead  to  an  increased  risk  of  breast  and  ovarian 
cancer,  with  loss  of  the  second,  wild  type  allele  critical  to  tumorigenesis  (1).  More  than 
400  distinct  BRCA1  germline  mutations  have  been  identified  to  date  (2).  Mutation 
detection  in  BRCA1  is  complicated  by  the  ubiquitous  distribution  of  these  mutations 
throughout  the  BRCA1  coding  region  (2).  As  a  result,  most  investigators  have  chosen  to 
use  a  combination  of  techniques  that  rely  on  amplification  of  small,  ~  150-700  base  pairs 
(bp)  fragments  from  the  BRCA1  genomic  region.  This  approach  has  worked  relatively 
well  for  germline  mutation  screening  of  BRCA1,  as  most  mutations  are  small  insertions, 
deletions,  or  single  bp  substitutions  that  lead  to  premature  protein  truncation  (2). 
However,  mutation  detection  based  on  amplification  of  small  fragments  will  not  identify 
large-scale  alterations  of  the  BRCA1  genomic  region  such  as  deletions,  duplications,  or 
inversions  of  hundreds  of  bp. 

Of  the  breast  and  ovarian  tumors  screened  by  several  independent  groups  only  a 
few  tumors  with  somatic  mutation  in  BRCA1  have  been  reported  (1,3-5).  All  fourteen  of 
the  tumors  harboring  these  somatic  mutations  exhibited  LOH  for  the  BRCA1  region  of 
chromosome  17. 

Thus,  BRCA1  somatic  mutations  detectable  by  conventional  PCR-based 
techniques  are  extremely  rare  in  breast  and  ovarian  tumors.  These  conventional 
screening  procedures,  however,  neglect  a  class  of  mutations:  large  genomic 
rearrangements.  Several  large  germline  deletions  and  one  large  germline  duplication  in 
BRCA1  have  been  reported  (6-1 1).  All  of  the  large  germline  rearrangements  were 
undetectable  by  conventional  genomic  DNA  (gDNA)  screening  procedures.  Large 
germline  deletions  appear  to  be  a  relatively  common  feature  of  inherited  breast  cancer  in 
the  Dutch  population,  due  in  large  part  to  founder  mutations  (8). 

Several  features  of  the  BRCA1  locus  provide  clues  as  to  the  nature  of  large 
genomic  rearrangements  that  might  occur.  BRCA1  has  one  of  the  highest  densities  of  Alu 
elements  of  genes  deposited  in  GenBank  (41.5%  of  81  kb)  (12).  Large  genomic 
rearrangements  in  human  genes  frequently  are  associated  with  Alu  repetitive  elements 
(13,14). 
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Another  predisposing  feature  of  the  BRCA1  locus  is  a  tandem  duplication 
involving  the  5'  region  of  BRCA1  (15,16).  The  duplication  includes  the  BRCA1  promoter 
region  and  a  neighboring  gene,  NBR1.  The  result  of  the  duplication  was  the  creation  of  a 
new  gene,  NBR2,  with  which  human  BRCA1  shares  a  bi-directional  promoter  and  the 
creation  of  a  pseudo-copy  of  BRCA1  situated  in  a  similar  head  to  head  orientation  with 
the  NBR1  gene  approximately  30  kb  upstream  of  the  functional  BRCA1  locus  (17).  The 
presence  of  a  pseudo-copy  of  BRCA1  with  >90%  identity  located  so  near  on  chromosome 
17  further  predisposes  the  functional  BRCA1  locus  to  inactivation  by  misaligned 
homologous  recombination  or  gene  conversion. 

Project  goals 

Several  possible  interpretations  could  explain  the  paradox  presented  by  the  lack  of 
somatic  mutations  in  BRCA1.  One  formal  explanation  is  that  BRCA1  is  not  involved  in 
sporadic  breast  and  ovarian  cancer.  It  is  possible  that  the  high  rate  of  LOH  for  the 
BRCA1  region  in  sporadic  tumors  actually  targets  a  nearby  gene,  and  not  BRCA1.  A 
second  explanation  is  an  alternative  mechanism  for  loss  of  the  second  BRCA1  allele  in 
sporadic  tumors.  A  third  possibility  is  that  the  lack  of  BRCA1  mutations  in  sporadic 
tumors  actually  reflects  limitations  in  current  mutation  screening  procedures.  Given  the 
genomic  organization  of  the  BRCA1  locus  and  the  methods  commonly  used  to  screen  for 
BRCA1  mutations,  it  is  conceivable  that  BRCA1  somatic  mutations  in  breast  and  ovarian 
cancer  exist  but  have  remained  undetected  by  conventional  mutation  detection  methods. 

The  research  detailed  in  the  following  sections  further  elucidates  the  role  of 
BRCA1  in  breast  cancer  by  analyzing  the  genetic  alterations  that  lead  to  inactivation  of 
BRCA1.  First,  the  characterization  of  large  germline  rearrangements  identified  in  BRCA1 
and  their  contribution  to  breast  cancer  in  a  population-based  series  of  breast  cancer 
patients  will  be  described.  Second,  the  nature  of  somatic  alterations  identified  from 
sporadic  breast  tumors  and  their  relationship  to  BRCA1  protein  expression  will  be 
detailed.  Finally,  a  rare  variant  allele  of  BRCA1  and  the  potential  significance  of 
noncoding  variation  in  the  BRCA1  genomic  region  will  be  described. 
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Body 


Characterization  of  a  large  germline  deletion  in  the  BRCA1  region  leading  to  breast  and 
ovarian  cancer  predisposition 

One  large  germline  rearrangement  was  identified  from  Family  58  (also  known  as 
Family  5).  The  rearrangement  consisted  of  a  1039  bp  deletion  with  a  10  bp  inverted 
duplication  inserted  at  the  breakpoint  junction  (g.  12977  ins  10  del 1039).  The  research 
characterizing  this  deletion  was  described  in  my  1998  Annual  Report  and  has  been 
accepted  for  publication  in  Genes,  Chromosomes,  &  Cancer. 

No  other  large  germline  rearrangements  were  detected  in  the  families  or  cell  lines 
using  either  Long  PCR  or  Southern  analysis.  Several  restriction  fragment  length 
polymorphisms  (RFLP’s)  were  observed.  The  RFLP’s  identified  and  their  observed 
frequencies  are  listed  in  Table  1. 

Population-based  investigation  of  two  large  germline  rearrangements  predisposing  to 
breast  and  ovarian  cancer 

Next  we  investigated  the  contribution  of  two  different  large  germline 
rearrangements  to  breast  cancer  in  the  American  population.  We  analyzed  one  large 
deletion  (the  g.  12977  ins  10  del  1039  mutation)  and  one  large  duplication  (the 
g.43368ins608 1  mutation)  in  a  population-based  series  of  American  breast  cancer 
patients  unselected  for  family  history  (10,18,19).  The  g.12977  inslO  dell039  mutation 
was  investigated  in  242  white  and  164  African-American  breast  cancer  patients.  No 
mutation  carriers  were  detected  in  406  cases.  The  g.43368ins6081  mutation  was 
investigated  in  220  white  and  153  African-American  breast  cancer  patients.  One 
mutation  carrier  (CBCS  950749)  was  detected  in  373  cases. 

The  g.43368ins6081  mutation  was  previously  identified  in  four  American  families 
of  mixed  European  descent  with  multiple  cases  of  breast  and/or  ovarian  cancer  and  one 
Portuguese  family  with  three  cases  of  breast  cancer  (10,  Heather  Mefford,  pers.  comm.). 
A  founder  effect  for  the  g.43368ins6081  mutation  was  postulated  based  on  the 
observation  that  four  of  the  families  shared  the  same  haplotype  at  nine  polymorphic 
markers  within  or  flanking  the  BRCA1  locus  (10). 
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LOH  of  the  BRCA1  region  in  sporadic  breast  carcinoma 

The  BRCA1  genomic  region  investigated  is  shown  in  Figure  1.  Of  the  paired 
breast  normal  and  tumor  samples  informative  for  LOH,  50%  (39/78)  of  invasive  ductal 
tumors  lost  an  allele  at  one  or  more  markers  within  the  BRCA1  region.  None  of  the  12 
lobular  carcinomas  showed  any  allelic  loss.  Of  the  56  tumors  informative  for  the  three 
BRCA1  intragenic  markers  (D17S1323,  D17S1322,  D17S855),  36  tumors  (64.3%)  had 
lost  at  least  one  intragenic  marker.  Allelic  loss  at  the  three  BRCA1  intragenic  markers  as 
well  as  at  the  three  flanking  markers  (D17S1320,  D17S1326,  D17S1325)  was  associated 
significantly  with  tumor  grade.  LOH  was  detected  in  44%  (4/9)  of  grade  I  tumors,  28% 

(1 1/40)  of  grade  II  tumors,  and  63%  (26/41)  grade  HI  tumors  (p<0.005).  This  association 
is  consistent  with  previous  reports  of  association  between  somatic  LOH  at  the  BRCA1 
locus  and  tumor  grade  (20-23).  Estrogen  receptor  status  was  independently  associated 
with  LOH  (p=0.046)  as  reported  previously  (22,23).  There  was  no  independent 
association  between  LOH  and  tumor  stage  or  age  at  diagnosis. 

Microsatellite  instability  in  an  infiltrating  ductal  carcinoma 

Case  12  was  unique  in  that  additional  alleles  appeared  at  four  markers  coincident 
with  LOH  at  the  remaining  two  markers.  In  order  to  test  whether  the  additional  alleles 
were  due  to  microsatellite  instability  (MSI),  we  analyzed  markers  indicative  of  MSI  for 
Case  12.  These  markers  were  shown  previously  to  be  diagnostic  of  a  mismatch  repair 
defect  in  tumor  cells  (24).  Five  of  ten  diagnostic  markers  were  analyzed  for  Case  12: 
BAT26,  BAT40,  D2S123,  D5S346,  and  Mfdl5.  Case  12  showed  instability  at  four  of  the 
five  markers  tested.  Only  the  BAT26  marker  was  stable  in  tumor  gDNA.  Thus,  Case  12 
showed  80%  unstable  loci  among  the  first  five  markers. 

Long  PCR  and  Southern  analysis  of  sporadic  breast  carcinomas 

Of  the  92  samples,  84  were  amplified  successfully  for  all  15  Long  PCR  primer 
pairs.  However,  no  large  somatic  rearrangements  were  detected.  Several  RFLP’s  were 
observed  in  both  normal  and  tumor  gDNA  from  the  cases.  These  are  listed  in  Table  2. 
One  large  germline  deletion  in  intron  15  of  BRCA1  was  detected  in  Case  72. 
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In  order  to  detect  very  large  deletions,  duplications,  and  rearrangements  that 
would  be  missed  by  PCR,  we  also  performed  Southern  analysis.  As  tumor  DNA  was 
limiting  and  because  no  large  rearrangements  were  identified  using  Long  PCR,  Southern 
analysis  was  performed  with  only  the  40  tumors  that  demonstrated  LOH  within  the 
BRCA1  region.  The  eight  samples  that  were  not  screened  fully  by  Long  PCR  were 
analyzed  by  Southern  as  well.  One  ~250  bp  germline  insertion  in  intron  7  of  BRCA1  was 
observed  in  Case  72.  No  large  somatic  rearrangements  were  detected. 

BRCA1  protein  expression  and  loss  of  heterozygosity 

In  collaboration  with  Rachel  Gonzalez-Hemandez  (Pathology  Department, 
University  of  Washington),  BRCA1  protein  expression  in  breast  normal  and  tumor  tissue 
sections  was  analyzed  by  immunohistochemistry  (IHC)  using  monoclonal  antibody 
MSI  10  as  previously  described  (25).  The  normal  and  tumor  breast  tissues  were  scored 
on  a  scale  of  0  to  4.  Normal  breast  epithelium  exhibited  staining  consistent  with  a  score 
of  3.  Reduced  BRCA1  protein  (scores  of  0  to  2)  was  observed  in  72%  (49/68)  of  the 
breast  tumors  and  in  the  majority  of  tumors  (82%,  28/34)  with  high  histologic  grade  as 
compared  to  normal  breast  epithelial  tissue  (p<0.001). 

Reduced  BRCA1  protein  expression  was  associated  with  genomic  loss  at 
D17S1323,  the  intragenic  marker  closest  to  the  BRCA1  transcription  start  site.  Reduced 
BRCA1  protein  staining  (scores  of  0  to  2  on  a  scale  of  0  to  4)  occurred  in  92%  (1 1/12)  of 
tumors  with  genomic  loss  at  D17S1323  but  in  55%  (12/22)  without  genomic  loss  at  this 
marker  (p<0.02).  Genomic  loss  at  marker  D17S1325  was  associated  marginally  with 
reduced  BRCA1  protein  expression  (p=0.05),  although  the  significance  of  this 
association  is  unclear.  Marker  D17S1325  is  located  ~300  kb  upstream  of  BRCA1. 
Although  marker  D17S1326  lies  between  D17S1323  and  D17S1325,  LOH  at  D17S1326 
is  not  associated  with  reduced  BRCA1  protein  staining. 

55%  (12/22)  of  tumors  in  the  series  had  reduced  BRCA1  protein  staining  (scores 
of  0  to  2)  despite  no  detectable  genomic  loss  of  the  BRCA1  transcription  start  as  indicated 
by  marker  D17S1323.  Reduced  BRCA1  protein  was  observed  in  a  high  proportion  (8/14, 
57%)  of  the  tumors  that  were  informative  at  all  three  intragenic  markers  and  fully 
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retained  both  alleles  of  BRCA1.  In  this  subset  of  tumors,  other  mechanisms  are  likely 
responsible  for  BRCA1  inactivation. 

Characterization  of  a  large  germline  deletion  of  unknown  significance  from  Case  72 

In  most  of  the  217  alleles  investigated,  the  general  organization  of  the  BRCA1 
region  was  consistent  with  the  published  sequence.  More  than  48  restriction  sites  over  a 
region  of  more  than  85  kb  were  analyzed  in  high-risk  breast  and/or  ovarian  cancer  family 
members  and  sporadic  breast  cancer  patients  unselected  for  family  history.  In  the  course 
of  analyzing  the  family  and  tumor  DNA  for  large  rearrangement  of  BRCA1,  however, 
two  large  germline  rearrangements  of  unknown  significance  were  identified  from  a  single 
allele  in  normal  and  tumor  gDNA  from  Case  72.  In  addition,  several  small  alterations 
were  identified  in  the  variant  allele  from  Case  72  containing  the  two  large  germline 
rearrangements. 

A  large  germline  deletion  of  BRCA1  was  identified  from  Case  72.  Using  primer 
pair  10,  designed  to  amplify  a  10.6  kb  product  encompassing  exons  15-18,  variant 
restriction  fragments  were  detected  using  four  different  restriction  enzymes.  The  variant 
restriction  fragments  were  present  in  equal  intensity  in  gDNA  from  normal  breast 
epithelial  tissue  of  Case  72,  but  the  variant  bands  were  preferentially  retained  in  gDNA 
prepared  from  an  infiltrating  ductal  carcinoma  of  Case  72. 

Based  on  the  restriction  digests,  a  minimal  region  of  ~3800  bp  (GenBank  L78833 
bp  54885-58706)  containing  a  deletion  of  -1000  bp  was  identified  near  BRCA1  exon  16. 
Wild  type  and  variant  PCR  products  encompassing  the  deletion  were  cloned  and 
sequenced.  All  clones  of  the  variant  PCR  products  contained  the  same  SNP’s  that  were 
not  observed  in  clones  from  the  wild  type  PCR  products.  However,  all  clones  from  the 
variant  PCR  products  contained  an  intact  exon  16,  indicating  that  the  breakpoint  occurred 
in  intron  15.  Sequence  of  all  variant  clones  revealed  the  simple  deletion  shown  in  Figure 
2.  Based  on  BRCA1  genomic  sequence  L78833,  the  deletion  is  a  977  bp  deletion  of  bp 
56025-57001  from  BRCA1  intron  15.  The  notation  for  the  deletion  (26)  is  g.56024 
del977  based  on  GenBank  L78833.  The  5'  breakpoint  for  g.56024  del977  occurs  in  the  3' 
tail  of  an  Alu-Sp  element  and  the  3'  breakpoint  occurs  9  bp  into  the  5'  head  of  an 
oppositely  oriented  Alu-Y  element.  Southern  analysis  using  a  probe  to  BRCA1  exons  1 1- 
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24  confirmed  the  deletion  and  indicated  that  the  deleted  allele  was  retained  in  Case  72 
tumor  DNA. 

Characterization  of  a  large  expansion  of  a  compound  microsatellite  sequence  in 
BRCA1  intron  7 

Southern  analysis  of  Hind  Ill-digested  Case  72  tumor  gDNA  revealed  an  extra 
band  of  ~4.3  kb.  Further  Southern  analysis  of  Case  72  normal  and  tumor  gDNA  using 
multiple  restriction  enzymes  indicated  that  a  minimal  region  of  -3500  bp  (GenBank 
L78833  bp  26,91 1-30394)  including  BRCA1  exon  8  contained  an  insertion  of  -250  bp 
(Figure  3).  Overlapping  PCR  primer  pairs  were  designed  to  span  the  region  and  tumor 
DNA  from  Case  72  was  amplified.  The  exon  8  containing  PCR  product  was  of  expected 
size,  but  a  primer  pair  (ivs7b)  amplifying  a  region  just  upstream  of  exon  8  (GenBank 
L78833  bp  27,656-28,733)  yielded  a  product  -250  bp  larger  than  expected  from  Case  72 
tumor  gDNA. 

The  variant  ivs  7b  product  for  Case  72  was  gel-extracted  and  sequenced  using 
internal  primers.  Multiple  independent  sequencing  reactions  revealed  an  expansion  of  a 
compound  microsatellite  located  at  GenBank  L78833  bp  -28,087  to  28,174  (Figure  4). 

In  the  published  genomic  sequence  the  imperfect  compound  microsatellite  GTN  (AT)^ 
is  87  bp  in  size.  In  the  variant  allele  from  Case  72,  the  compound  microsatellite  is  333  bp 
in  size. 

Other  sequence  alterations  observed  in  the  variant  allele  from  Case  72 

Several  other  alterations  were  observed  in  the  variant  allele  retained  in  the  tumor 
of  Case  72  (Figure  5).  Two  single  bp  substitutions  were  observed  in  exon  16.  One  is  a  C 
to  T  transition  at  nucleotide  (nt)  4801  of  HSU  146 80.  The  substitution  does  not  affect 
translation  of  the  BRCA1  message.  This  SNP  has  been  observed  before,  although  at  a 
very  low  frequency.  It  was  identified  in  one  African-American  patient  in  the  Carolina 
Breast  Cancer  Study  and  is  present  in  about  1%  of  the  population  (2).  The  second 
substitution  is  a  A  to  G  transition  at  nt  4956.  The  substitution  results  in  a  missense 
polymorphism  at  codon  16 13  (S1613G).  This  missense  polymorphism  has  been  observed 
before  as  well  and  is  present  in  32%  of  the  population  (27).  Finally,  a  24  bp  insertion  at 


GenBank  L78833  bp  57,21 1  was  identified  in  the  variant  allele  (Figure  6).  The  origin  of 
the  24  bp  insertion  is  unknown,  but  it  contains  a  region  of  13-20  bp  identity  with 
sequence  from  human  chromosome  17  clone  hRPK.268_F_2,  human  chromosome  6  PAC 
271G9,  and  an  unassigned  human  BAC  RG191D16. 

In  summary,  one  large  rearrangement  identified  from  Case  72  involves  germline 
deletion  of  977  bp  from  BRCA1  intron  15.  The  second  large  rearrangement  involves 
germline  expansion  of  a  compound  microsatellite  sequence  in  BRCA1  intron  7.  While 
neither  of  the  rearrangements  alters  coding  regions  of  BRCA1,  they  are  of  interest 
because  the  allele  that  harbors  both  of  these  rearrangements  is  retained  in  an  infiltrating 
ductal  carcinoma  from  Case  72  while  the  wild-type  allele  of  BRCA1  is  lost. 

There  is  a  50%  probability  that  the  Case  72  variant  allele  of  BRCA1  is  retained  in 
the  tumor  randomly.  In  order  to  test  whether  the  large  rearrangements  might  affect 
processing  of  the  BRCA1  transcript  from  the  rare  allele,  we  analyzed  splicing  of  exons 
flanking  the  intronic  rearrangements.  The  large  germline  rearrangements  do  not  affect 
splicing  of  BRCA1  as  determined  by  RT-PCR  using  a  variety  of  primer  pairs  located  in 
exons  flanking  the  rearrangements.  Additionally,  we  tested  the  variant  allele  for  loss  of 
transcript  using  single  nucleotide  a  polymorphisms  (SNP)  that  was  identified  in  exon  16. 
The  variant  allele  was  stable  and  was  preferentially  retained  in  RNA  from  the  invasive 
ductal  carcinoma  of  Case  72. 

The  difficulty  in  classifying  the  status  of  these  large  germline  rearrangements 
highlights  an  area  of  increasing  interest:  noncoding  variation  in  the  human  genome.  The 
degree  of  variability  within  the  BRCA1  region  has  important  implications  for  assessing 
the  age  of  founder  mutations.  Additionally,  an  elevated  mutation  rate  within  the  BRCA1 
region  could  implicate  BRCA1  as  a  target  of  genomic  instability.  It  will  be  important  to 
assess  the  degree  and  nature  of  noncoding  variation  within  the  BRCA1  region  in  order  to 
determine  the  effect  of  rare  alleles  such  as  the  variant  allele  retained  in  the  infiltrating 
ductal  carcinoma  of  Case  72.  The  determination  of  the  amount  and  the  degree  to  which 
individual  genomes  vary  will  be  critical  to  determining  both  the  relevance  to  human 
disease  for  variant  alleles  such  as  the  rare  allele  described  from  Case  72  and  the  use  of 
genetic  variation  to  infer  the  evolutionary  history  of  human  populations. 
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Table  1  Restriction  Fragment  Length  Polymorphisms  (RFLP’s)  Identified  in 
BRCA1  from  Breast  Cancer  Cases 
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Location  of  the  restriction  cut  site  affected  unless  the  specific  nucleotide  alteration  was  characterized. 
The  number  of  chromosomes  in  which  the  RFLP  was  identified. 


Table  2  Restriction  Fragment  Length  Polymorphisms  (RFLP’s)  Identified  in 
BRCA1  from  Breast  Cancer  Cases 


Of) 

<D 

<D 

s 

s 

S 

A 

o 

o 

o 

pQ 

> 

GO 

o 

o 

00 

O 

c-> 

s 

£ 

s 

c 

o 

o 

o 

0> 

J3 

w 

43 

S3 

o 

o 

o 

o 

a> 

oo 

00 

oo 

Sn 

y—t 

v-H 

fa 

i n 

2h 

08 

oo 

QO 

r- 

& 

co 

PQ 

fl 

o 

o 


o 

On 

00 

<N 

r- 

CO 

vq 

r- 

in" 

o' 

CO 

s 

#o 

co 

CJ 

O 

fa 


CO 

Cfl 

> 


CO  00 

Crt  GO 

>  #> 


d  X) 


(14) 


Location  of  the  restriction  cut  site  affected  unless  the  specific  nucleotide  alteration  was  characterized. 
The  number  of  chromosomes  in  which  the  RFLP  was  identified. 


Figure  1 
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Figure  1  92  breast  carcinomas  were  analyzed  for  LOH  using  the  six  microsatellite  markers  indicated  in  blue.  84  of 

92  breast  carcinomas  were  analyzed  by  Long  PCR  and  46  of  92  breast  carcinomas  were  analyzed  by  Southern  for  large 
rearrangement  of  BRCA 1.  68  of  92  breast  carcinomas  were  analyzed  by  immunohistochemistry  using  the  MSI  10  MAb. 
The  BRCA1  exons  that  encode  the  epitope  recognized  by  MSI  10  are  indicated  in  orange. 


Figure  2 


Wild-type  BRCA1  Sequence: 


56025 


977  bp  Germline  Deletion  in  Case  72: 


56024  57002 


Figure  2  On  the  left  is  sequence  from  the  unretained  Case  72  BRCA1  allele. 
The  sequence  from  the  unretained  allele  matches  the  GenBank  L78833  sequence. 
On  the  right  is  sequence  from  the  977  bp  deletion  breakpoint  junction  of  Case 
72. 
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Figure  3 
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Figure  3  Southern  analysis  of  a  large  germline  rearrangement  involving  the 
BRCA1  exon  8  genomic  region.  10  ag  of  restriction  enzyme-digested  genomic 
DNA  was  electrophoresed  and  Southern  blotted  as  described  in  Materials  and 
Methods.  The  blot  was  hybridized  to  a  radioactively  labelled  probe  made  from 
exons  3  to  1 1  of  the  BRCA1  cDNA.  **  indicates  the  rearrangement  junction 
fragment.  Control  individuals  are  from  Family  48. 
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Figure  4 


Wild-type  BRCAI: 


Case  72  Insertion: 

3’  5’ 


Figure  4  BRCAI  intron  7  sequence  from  a  control  individual  (4310)  is  shown 
in  the  top  panel.  The  sequence  from  the  control  allele  matches  the  GenBank 
L78833  sequence.  Sequence  from  the  BRCAI  intron  7  insertion  of  "250  bp 
identified  in  the  retained  allele  from  Case  72  is  shown  in  the  bottom  panel.  The 
insertion  consists  of  an  expansion  of  the  compound  microsatellite  repeat  unit 
(AT)n  AAC.  Both  sequences  begin  at  approximately  bp  28,084  of  the  GenBank 
L78833  sequence.  The  3’  and  5’  symbols  represent  sequence  orientation  with 
respect  to  BRCAI  transcription.  BRCAI  exon  8  is  located  at  bp  28,853  to  28,957of 
GenBank  L78833.  The  compound  microsatellite  repeat  unit  is  indicated  by  a 
red  bar.  The  3’  tail  of  an  Alu  Y  element  is  indicated  in  the  control  sequence.  In 
the  retained  allele  from  Case  72,  the  Alu  Y  element  lies  "250  bp  downstream  of 
the  sequence  shown. 
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Figure  5 


Wild-type  Allele: 


C4801T  in  Case  72: 


Wild-type  Allele:  A4956G  in  Case  72: 

3’  5’  3’  5’ 


Figure  5  Single  nucleotide  polymorphisms  identified  in  the  variant  allele  from 
Case  72.  The  panels  on  the  left  represent  wild-type  sequence  in  the  wild-type 
allele  from  Case  72.  The  panels  on  the  right  represent  sequence  of  single  nucleotide 
polymorphisms  in  BRCA1  exon  16  of  the  variant  allele  from  Case  72.  In  the  top 
panels,  sequence  flanking  a  C  to  T  transition  at  nt  4801  of  HSU  14680  is  shown. 
In  the  bottom  panels,  sequence  flanking  a  A  to  G  transition  at  nt  4956  of  HSU 
14680  is  shown.  The  red  arrows  indicate  the  location  of  the  transition  substitutions. 
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Figure  6 


Wild-type:  24  bp  Insertion  in  Case  72: 


Figure  6  24  bp  insertion  in  BRCA1  intron  15.  Sequence  from  the  unretained 

Case  72  BRCA1  allele  is  shown  in  the  left  panel.  The  sequence  from  the  unretained 
allele  matches  the  GenBank  L78833  sequence.  Sequence  from  the  insertion 
junction  fragment  of  the  24  bp  identified  in  the  variant  allele  from  Case  72  is 
shown  in  the  right  panel.  The  red  bar  indicates  the  inserted  nucleotides  and  the 
red  arrow  indicates  the  site  of  insertion. 
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Key  Accomplishments 


1)  We  designed  a  Long  PCR  screen  for  detecting  large  genomic  rearrangement  of 
BRCA1.  The  screen  has  been  used  successfully  to  identify  two  novel  large  germline 
rearrangements  in  our  lab.  A  third  large  germline  rearrangement  of  BRCA1  was 
identified  using  Southern  analysis. 

2)  We  identified  a  complex  large  germline  deletion  within  BRCA1  that  results  in  exon  3 
being  skipped  in  a  family  with  inherited  breast  and  ovarian  cancer.  The  mutation 
differs  from  previous  reports  in  that  neither  of  the  deletion  breakpoints  occurs  in  an 
Alu  element  and  there  is  no  more  than  4  bp  identity  between  sequence  near  the  5’ 
breakpoint  and  sequence  near  the  3’  breakpoint  Thus,  this  mutation  is  likely  the 
result  of  a  nonhomologous  mechanism  and  is  unique  among  large  genomic 
rearrangements  of  BRCA1. 

3)  We  analyzed  92  breast  carcinomas  unselected  for  family  history  using  the  Long  PCR 
screen  in  combination  with  Southern  analysis.  Large  somatic  rearrangements  of 
BRCA1,  as  with  small  mutations  in  BRCA1,  are  rare  in  sporadic  breast  cancer. 

4)  Reduced  BRCA1  protein  expression  was  associated  with  genomic  loss  at  D17S1323, 
the  intragenic  marker  closest  to  the  BRCA1  transcription  start  site.  The  association 
of  reduced  BRCA1  protein  with  BRCA1  LOH  in  tumors  is  specific  to  LOH  at  marker 
D17S1323,  the  intragenic  marker  closest  to  the  5'  end  of  BRCA1. 

5)  We  characterized  a  rare  variant  allele  of  BRCA1  in  which  two  large  germline 
rearrangements,  as  well  as  numerous  small  genomic  alterations,  reside  in  cis. 
Although  the  variant  allele  is  retained  in  an  invasive  ductal  carcinoma,  no  effect  on 
the  stability  or  splicing  of  BRCA1  from  the  variant  allele  was  observed.  The 
difficulty  in  classifying  the  status  of  these  large  germline  rearrangements  highlights 
an  area  of  increasing  importance:  noncoding  variation  in  the  human  genome.  The 
determination  of  the  amount  and  the  degree  to  which  individual  genomes  vary  will 
be  critical  to  determining  both  the  relevance  to  human  disease  for  variant  alleles  such 
as  the  rare  allele  described  from  Case  72  and  the  use  of  genetic  variation  to  infer  the 
evolutionary  history  of  human  populations. 
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Key  Accomplishments  (cont.) 


6)  Many  breast  tumors  in  our  series  (55%,  12/22)  had  reduced  BRCA1  protein  staining 
despite  no  detectable  genomic  loss  of  the  BRCA1  transcription  start  as  indicated  by 
marker  D17S1323.  Reduced  BRCA1  protein  was  observed  in  a  high  proportion 
(8/14,  57%)  of  the  tumors  that  were  informative  at  all  three  intragenic  markers  and 
fully  retained  both  alleles  of  BRCA1.  In  this  subset  of  tumors,  other  mechanisms  are 
likely  responsible  for  BRCA1  inactivation. 
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Appendix  One:  Published  Abstracts 


Payne  SR  &  King  M-C.  1997.  Complex  BRCA1  germline  deletion  and 
rearrangement  of  1039  bp  leads  to  deletion  of  exon  3  and  disease  predisposition: 
Splicing  by  exon  definition  in  BRCA1.  Am  J  Hum  Genet,  61  Suppl:  A425. 

BRCA1  mutations  detectable  by  current  PCR-based  techniques  are  single  bp 
substitutions  or  small  insertions/deletions  which  comprise  most  germline  mutations  in 
high-risk  families,  but  almost  never  occur  as  somatic  mutations.  Conversely,  Mbp 
somatic  deletions  including  BRCA1,  as  reflected  by  LOH,  occur  frequently  in  both 
inherited  and  sporadic  breast  and  ovarian  cancer.  We  are  characterizing  an  intermediate 
class  of  mutations,  involving  deletions  or  rearrangements  of  100’s  to  1000’s  of  bp.  One 
such  germline  mutation  has  been  reported  in  BRCA1  (Feunteun  et  al.,  1997.  Cancer  Res, 
57:828). 

In  Family  5,  previous  analysis  of  cDNA  revealed  germline  deletion  of  exon  3 
(codon  27  stop),  cosegregating  with  10  cases  of  breast  and  ovarian  cancer  (Friedman  et 
al.,  1994.  Nat  Genet.,  8:399);  the  genomic  basis  of  the  mRNA  deletion  remained 
unknown.  Long-range  PCR,  using  an  exon  3  forward/”exon  4”  reverse  primer  pair 
designed  from  BRCA1  genomic  sequence  (gb  L78833),  reveals  a  1039  bp  genomic 
deletion  involving  part  of  exon  3  and  part  of  intron  3.  The  mutant  genomic  sequence  is: 
wt  bpl-22  of  exon  3;  insert  inverted  bp  22-13  of  exon  3;  deletion  of  1039  bp;  wt  bp 
14015  ff  intron  3.  Despite  wt  5’  and  3’  splice  sites  for  intron  2,  exon  3  is  completely 
skipped.  This  mutation  indicates  that  at  least  in  some  regions  of  BRCA1,  splicing 
follows  the  exon  definition  model.  Next,  we  will  determine  whether  intermediate  length 
deletions  occur  as  somatic  mutations  in  sporadic  breast  and  ovarian  tumors. 
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Mu  H,  Newman  B,  Rousseau  C,  Payne  S,  Gordon  K,  Gold  D,  Tse  C-K,  Millikan  RC 
&  King  M-C.  1999.  Frequency  of  breast  cancer  attributable  to  BRCA1  and  BRCA2 
in  a  population-based  series  of  Caucasian  and  African-American  women.  Am  J  Hum 
Genet,  65  Suppl:  A106. 

To  determine  the  public  health  impact  of  inherited  mutation  in  BRCA1  or  BRCA2 
on  breast  cancer,  both  genes  were  fully  screened  in  a  population-based  series  of 
sequentially  diagnosed  Caucasian  and  African-American  breast  cancer  patients  from 
North  Carolina.  New  results  for  BRCA2  and  recently  discovered  large  genomic 
alterations  complete  our  previously  published  results  on  BRCA1  in  the  same  cohort 
(Newman  et  al.,  JAMA  279:  915-921,  1998).  BRCA2  was  analyzed  for  germline 
variants  in  the  coding  sequence,  splicing  junctions  and  neighboring  intronic  regions  using 
multiplex  single-strand  conformation  analysis,  heteroduplex  analysis,  and  DNA 
sequencing  of  203  cases.  In  addition,  373  cases  were  screened  for  the  large  genomic 
mutations.  After  adjustment  for  sampling  probabilities,  the  weighted  prevalences  (%)  of 
breast  cancer  attributable  to  BRCA1  or  BRCA2  were  7.4  (95%  Cl,  1.3-12.1)  for 
Caucasian  women,  1.0  (95%  Cl,  0-3.0)  for  African-American  women,  and  6.1  (95%  Cl, 

1. 2-9.8)  overall.  Among  probands  with  at  least  three  affected  relatives,  13%  carried  a 
BRCA1  or  BRCA2  mutation.  Among  breast  cancer  probands  with  any  relative  with 
ovarian  cancer,  22%  carried  a  mutation.  Among  American  breast  cancer  patients 
generally,  those  from  these  high  risk  families  can  most  benefit  from  full  genotyping  of 
BRCA1  and  BRCA2. 
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Payne  SR,  Gonzalez-Hernandez  &  King  M-C.  1999.  BRCA1  in  sporadic  breast 
cancer:  Genomic  loss  and  protein  expression  in  primary  tumors.  Am  J  Hum  Genet, 
65  Suppl:  A 109. 

In  inherited  breast  cancer,  BRCA1  acts  as  a  classic  tumor  suppressor  in  that  both 
normal  copies  of  BRCA1  are  lost,  one  in  the  germline  and  one  somatically.  In  sporadic 
tumors,  large  somatic  deletions  including  BRCA1,  as  reflected  by  loss  of  heterozygosity 
(LOH),  occur  frequently,  while  somatic  point  mutations  are  rare.  One  argument  for  the 
role  of  BRCA1  in  sporadic  breast  cancer  has  been  the  high  rate  of  allelic  loss  observed 
for  the  BRCA1  locus.  Sporadic  invasive  breast  tumors  were  evaluated  for  protein 
expression  and  genomic  loss  at  BRCA1.  Protein  expression  was  evaluated  by 
immunohistochemistry  using  the  N-terminal  monoclonal  antibody  MSI  10.  Genomic  loss 
was  evaluated  by  LOH  using  multiple  markers  within  and  flanking  BRCA1,  by  Long 
PCR  (7-12kb  fragments)  of  the  entire  BRCA1  genomic  region  and  by  Southern  analysis. 
BRCA1  protein  expression  was  significantly  decreased  in  72%  (49/68)  of  invasive  ductal 
carcinomas  and  correlated  significantly  with  tumor  grade,  as  previously  observed 
(Wilson  et  al,  1999  Nat  Genet  21:236,  Lee  et  al,  1999  Histopath  34;106).  No  large 
genomic  rearrangements  of  somatic  origin  were  detected,  although  one  germline 
rearrangement  was  detected  by  Southern  analysis.  Although  allelic  loss  at  all  intragenic 
markers  is  not  associated  with  BRCA1  protein  reduction,  allelic  loss  at  the  marker  nearest 
the  transcription  start  site  (D17S1323)  is  associated  with  reduction  of  BRCA1  protein. 
This  observation  might  reflect  specific  targeting  of  either  the  BRCA1  promoter  or 
BRCA1  exon  1 1  for  loss.  The  function  of  BRCA1  that  is  involved  in  maintenance  of 
genetic  stability  throught  the  G2-M  checkpoint  has  been  shown  to  require  exon  1 1  (Xu  et 
al,  1999  Mol  Cell  3:389).  A  high  proportion  (57%,  8/14)  of  the  tumors  that  retained  both 
BRCA1  alleles  showed  loss  of  the  BRCA1  protein.  In  this  subset  of  tumors  other 
mechanisms  may  be  responsible  for  BRCA1  inactivation. 


Appendix  Two:  Manuscripts 


Complex  germline  rearrangement  of  BRCA1  associated  with  breast  and 

ovarian  cancer. 

Shannon  R.  Payne1,  Beth  Newman23,  and  Mary-Claire  King4 


1  Department  of  Genetics,  University  of  Washington,  Seattle,  WA,  USA  98195 

2  Lineberger  Cancer  Center,  University  of  North  Carolina,  Chapel  Hill,  North  Carolina 
27599 

3  Current  Address:  School  of  Public  Health,  Queensland  University  of  Technology, 
Brisbane,  Queensland,  4059,  Australia 

4  Departments  of  Genetics  &  Medicine,  Division  of  Medical  Genetics,  University  of 

Washington,  Seattle,  WA,  USA  98 195 

Corresponding  Author:  Mary-Claire  King 
Telephone:  206-616-4294 
Fax:  206-616-4295 

e-mail:  mcking@u.washington.edu 


Running  Title:  Complex  rearrangement  of  BRCA1 


ABSTRACT 


Germline  mutations  in  BRCA1  predispose  to  breast  and  ovarian  cancer.  Most  germline 
BRCA1  mutations  are  small  insertions,  deletions  or  single  base  pair  (bp)  substitutions. 
These  mutation  classes  are  rarely  found  as  somatic  mutations  in  BRCA1.  Conversely, 
somatic  deletions  of  multiple  megabase  pairs  (Mb)  including  BRCA1,  as  reflected  by  loss 
of  heterozygosity,  occur  frequently  in  both  inherited  and  sporadic  breast  and  ovarian 
cancer.  In  order  to  determine  whether  deletions  or  rearrangements  of  hundreds  to 
thousands  of  bp  might  contribute  to  inherited  mutation  in  BRCA1,  we  developed  a  Long 
PCR  strategy  for  screening  the  entire  genomic  BRCA1  locus  in  high-risk  families.  We 
evaluated  genomic  DNA  from  one  high-risk  family  of  Western  European  ancestry  with 
BRCA /-linked  cancer  in  which  no  genomic  mutations  had  been  detected  using  conventional 
methods.  Long  PCR  revealed  a  complex  mutation,  g.  12977  inslO  del  1039  (based  on 
GenBank  L78833)  comprising  an  inverted  duplication  and  deletion  in  BRCA1  that  removes 
portions  of  exon  3  and  intron  3,  including  the  5'  splice  site  for  intron  3.  As  a  result  of  the 
deletion  exon  3  is  skipped,  leading  to  a  truncated  protein  and  disease  predisposition.  Unlike 
previously  reported  large  germline  deletions  in  BRCA1,  neither  breakpoint  resides  within 
an  Alu  element.  The  g.  12977  ins  10  del  1039  mutation  was  not  detected  among  eleven  other 
breast  cancer  families,  nor  among  406  breast  cancer  patients  unselected  for  family  history. 

Keywords:  BRCA1,  large  deletion,  RNA  splicing,  exon  skipping 


Germline  protein-truncating  mutations  in  BRCA1  cause  hereditary  predisposition  to 
breast  and  ovarian  cancer  (Hall  et  al.,  1990;  Ford  et  al.,  1994).  BRCA1  mutations 
detectable  by  commonly  employed  PCR-based  techniques  are  small  insertions,  deletions  or 
single  base  pair  (bp)  substitutions  which  comprise  most  germline  mutations  in  high-risk 
families  (Breast  Cancer  Information  Core.  1999),  but  rarely  occur  as  somatic  mutations 
(Futreal  et  al.,  1994;  Hosking  et  al.,  1995;  Matsushima  et  al.,  1995;  Merajver  et  al.,  1995; 
Takahashi  et  al.,  1995;  Berchuck  et  al.,  1998).  Conversely,  megabasepair  (Mb)  somatic 
deletions  including  BRCA1,  as  reflected  by  loss  of  heterozygosity  (LOH),  occur  frequently 
in  both  inherited  and  sporadic  breast  and  ovarian  cancer  (Devilee  et  al.,  1994;  Bieche  et  al., 

1995).  We  have  developed  a  protocol  to  screen  for  an  intermediate  class  of  mutations, 
involving  deletions  or  rearrangements  of  hundreds  to  thousands  of  bp.  Several  large 
deletions  and  one  large  duplication  involving  Alu  repetitive  elements  have  been  reported 
previously  as  germline  mutations  in  BRCA1  (Puget  et  al.,  1997;  Swensen  et  al.,  1997; 

Petrij-Bosch  et  al.,  1997;  Puget  et  al.,  1999a;  Puget  et  al.,  1999b). 

We  report  here  the  identification  and  characterization  of  a  large  genomic  deletion  in  a  family  with 
inherited  breast  and  ovarian  cancer.  The  mutation  results  in  skipping  of  exon  3  in  the  mature 
BRCA1  message.  Although  the  genomic  mutation  was  not  detectable  using  conventional  PCR- 
based  BRCA1  screening  procedures,  it  was  identified  using  a  Long  PCR  strategy  designed  to 
detect  intermediate  length  genomic  deletions  and  rearrangements. 

Previous  analysis  of  lymphoblast  cDNA  from  Family  5  revealed  a  transcript  lacking 
exon  3  in  several  BRCA1- linked  family  members  (Friedman  et  al.,  1994;  Fig.l).  Deletion 
of  exon  3  maintains  the  reading  frame,  but  creates  a  stop  at  codon  27.  Splice  junctions  of 
exons  2,  3,  and  5  (the  exon  joined  to  the  3'  splice  site  of  exon  3)  were  sequenced  from 
PCR  products  using  primers  immediately  flanking  the  exons,  yielding  only  wild-type 
sequence  (Friedman  et  al.,  1994).  The  genomic  basis  of  the  variant  remained  unknown. 


For  the  analysis  described  here,  gDNA  from  family  members  was  amplified  using  the 
fifteen  PCR  primer  pairs  indicated  in  Table  1.  Primers  were  designed  from  BRCA1 
genomic  sequence  (GenBank  L78833).  Primer  pairs  were  designed  to  amplify  3  to  1 1  kb 
genomic  fragments  with  at  least  1  kb  of  overlap.  PCR  products  were  evaluated  for 
differences  from  predicted  size  before  and  after  digestion  with  appropriate  restriction 
enzymes. 

Using  primer  pair  3  designed  to  amplify  a  5531  bp  product  encompassing  exon  3,  a 
variant  Long  PCR  product  co-segregated  with  the  BRCA1- linked  haplotype  of  Family  5 
(Fig.  1 , 2).  All  family  members  heterozygous  for  transcripts  lacking  exon  3  were  also 
heterozygous  for  the  variant  PCR  product. 

Wild-type  and  variant  Long  PCR  products  were  cloned  and  sequenced  from  multiple 
family  members.  Sequence  of  all  variant  clones  revealed  the  complex  mutation  shown  in 
Figure  3.  Based  on  BRCA1  genomic  sequence  L78833,  the  mutation  is  a  lObp  inverted 
duplication  of  12965-12974  from  BRCA1  exon  3  (bp  210-219  of  BRCA1  cDNA, 
F1SU14680)  and  deletion  of  1039bp.  The  notation  for  the  mutation  (Antonarakis  et  al., 
1998)  is  g.  12977  inslO  dell039  based  on  GenBank  L78833.  The  net  deletion  is  1029  bp, 
yielding  an  amplified  product  of  4502  bp  which  is  consistent  with  the  electrophoretic 
mobility  of  the  variant  PCR  product.  The  1039  bp  genomic  deletion  in  Family  5  results  in 
skipping  of  BRCA1  exon  3  in  the  mRNA  and  premature  protein  truncation  at  codon  27. 

Three  other  families  with  breast  cancer  linked  to  BRCA1  were  tested  for  the  g.  12977 
inslO  de!1039  mutation.  None  of  the  families  contained  the  g.12977  inslO  dell039 
mutation.  Families  with  multiple  cases  of  breast  and  or  ovarian  cancer  and  with  negative 
LOD  scores  for  linkage  to  BRCA1  might  nonetheless  harbor  mutations  in  one  of  these 
genes  if  some  cases  are  sporadic  (ie.  phenocopies).  Seven  such  families  were  screened  for 
g.  12977  inslO  del  1039.  All  were  wild-type  at  this  site.  Finally,  we  investigated  the 
contribution  of  the  g.  12977  ins  10  del  1039  mutation  to  breast  cancer  in  a  population-based 


series  of  242  white  and  164  African-American  breast  cancer  patients  unselected  for  family 
history.  Patients  were  ascertained  previously  as  part  of  the  Carolina  Breast  Cancer  Study 
(Newman  et  al.,  1995;  Newman  et  al.,  1998).  A  PCR  primer  pair  was  designed  to  detect  a 
514  bp  breakpoint  junction  fragment  for  the  g.  12977  inslO  dell039  mutation.  Primers  used 
to  amplify  the  breakpoint  junction  fragment  were:  (forward)  5'-TTT-TTC-TCC-CCC-CCT- 
ACC-CTG-3';  (reverse)  5'-GCT-CAG-CAT-TTG-TTA-CTC-AAG-CTG-3'.  No  mutation 
carriers  were  detected  in  406  cases. 

The  mutation  in  Family  5  differs  from  previously  reported  large  germline  deletions  in 
BRCA1.  Many  large  BRCA1  mutations  involve  Alu  sequences  (Puget  et  al.,  1997; 

Swensen  et  al.,  1997;  Petrij-Bosch  et  al.,  1997;  Puget  et  al.,  1999a;  Puget  et  al.,  1999b). 
For  two  of  these,  both  the  5'  and  3'  breakpoints  reside  in  Alu  sequence.  In  one  of  these 
families,  a  1  kb  deletion  of  exon  17  is  mediated  by  oppositely  oriented  Alu  elements, 
removing  most  parts  of  both  Alu  elements  from  the  mutant  allele  (Puget  et  al.,  1997).  In 
another  family,  a  14  kb  deletion  involving  the  BRCA1  promoter  region  creates  a  new  Alu 
element  (Swensen  et  al.,  1997).  In  contrast,  the  5'  breakpoint  for  the  Family  5  mutation 
occurs  within  BRCA1  exon  3  and  the  3'  breakpoint  occurs  in  intron  3  at  position  +1008. 
Neither  of  these  breakpoints  occurs  in  an  Alu  element.  Further,  there  is  no  more  than  4  bp 
identity  between  sequence  near  the  5'  breakpoint  and  sequence  near  the  3'  breakpoint, 
suggesting  that  this  mutation  is  the  result  of  a  nonhomologous  event. 

Combinations  of  inversions  and  deletions  have  been  seen  in  both  somatic  and  germline 
mutations  of  several  genes  including  TP53  (Greenblatt  et  al.,  1996)  and  Factor  IX 
(Ketterling  et  al.,  1994;  Sommer,  1995).  Studies  of  somatic  mutations  in  HPRT  reveal 
alterations  with  interesting  similarity  to  the  naturally  occurring  BRCA1  germline  mutation 
of  Family  5.  Somatic  deletions  in  HPRT  often  include  inserted  bases  at  the  breakpoint 
junctions  that  are  inverted  complements  of  sequences  found  at  the  breakpoints  (Rainville  et 
al.,  1995).  In  HPRT,  topoisomerase  I  and  topoisomerase  II  have  been  implicated  in 


generating  the  free  DNA  ends  that  may  result  in  these  nonhomologous  recombination 
events  (Rainville  et  al.,  1995;  Monnat  et  al.,  1992).  Topoisomerase  II  sites  also  occur  near 
rearrangement  breakpoints  in  the  Dystrophin  gene  (Hu  et  al.,  1991)  and  ring  chromosome 
21  (Wong  et  al.,  1989).  In  vitro  and  in  vivo  evidence  for  the  role  of  topoisomerase  II  in 
nonhomologous  recombination  is  well  established  in  prokaryotes  (O’Connor  et  al.,  1985; 
Ikeda,  1986)  and  a  role  for  vertebrate  topoisomerase  II  in  nonhomologous  recombination 
has  been  demonstrated  in  vitro  (Bae  et  al.,  1988).  Topoisomerase  II  boxes  are  in  close 
proximity  to  the  3'  deletion  breakpoint  of  the  complex  mutation  in  Family  5.  Sequences 
matching  13  of  15  bp  of  the  topoisomerase  II  d  box,  GTN(A/T)A(T/C)ATTNATNN(A/G), 
occur  in  BRCA1  intron  3  (Sander  and  Hsieh,  1985).  Either  the  sequence 
tTAAgTATTTATGTA  at  position  +948  or  the  sequence  aTGT AC ATTTtTCTG  at  +1225 
might  serve  as  a  recognition  site  for  cleaving  the  double  stranded  DNA  at  +1008  of  BRCA1 
intron  3. 

Mutations  in  BRCA1  and  BRCA2  account  for  at  least  80%  of  hereditary  breast  and 
ovarian  cancer  (Narod  et  al.,  1995;  Rebbeck  et  al.,  1997).  Complex  BRCA1  and  BRCA2 
mutations  such  as  that  in  Family  5  account  for  a  still  unknown  fraction  of  unexplained 
families  with  inherited  breast  and  ovarian  cancer  (Narod  et  al.,  1995;  Rebbeck  et  al.,  1997; 
Schubert  et  al.,  1997).  Additionally,  there  may  be  other' as  yet  uncharacterized  genes  that 
predispose  to  hereditary  breast  cancer  (Rebbeck  et  al.,  1997;  Schubert  et  al.,  1997). 

The  mutation  in  Family  5  adds  to  a  growing  class  of  mutations  not  detectable  by 
conventional  screening  methods.  Amplification  followed  by  genomic  SSCP  analysis  would 
invariably  amplify  the  wild-type  allele  in  Family  5.  Direct  sequencing  of  splice  junctions 
would  not  identify  the  genomic  mutation  because  the  primer  routinely  used  to  analyze  the  5' 
splice  site  for  intron  3  was  deleted  in  the  variant  allele.  Reports  of  genomic  deletions  of 
BRCA1  that  were  undetected  by  conventional  methods  further  illustrate  the  limitations  in 
standard  BRCA1  and  BRCA2  screening  procedures. 


As  more  large  genomic  deletions  and  rearrangements  are  characterized  in  the  BRCA1 
region,  several  questions  remain  to  be  addressed.  Are  Alu-Alu  recombination  events 
prominent  in  the  BRCA1  region,  as  seen  for  deletions  in  the  P-globin  gene  cluster 
(Henthom  et  al.,  1990),  or  no  more  likely  to  be  involved  than  random  sequences  in  the 
region?  Likewise,  which  is  more  representative  of  large  deletions  and  rearrangements  in  the 
BRCA1  region:  the  complex  inverted  duplication  with  deletion  observed  in  Family  5  in  this 
study  or  the  simple  deletions  reported  previously?  Of  primary  importance  is  whether  such 
mutations  occur  somatically  as  well  as  in  the  germline.  If  such  mutations  do  exist 
somatically,  it  will  be  interesting  to  determine  whether  they  are  similar  in  Alu-dependence 
and  complexity  to  those  observed  in  the  germline. 
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Legends  to  Figures 


Figure  1.  Pedigree  for  Family  5.  Three  generation  pedigree  for  Family  5  indicating 
markers  used  to  establish  linkage.  Below  the  linkage  markers  are  displayed  the  results  of 
both  SSCP  analysis  of  cDNA  and  Long  PCR  analysis  of  genomic  DNA  using  primer  pair 
no.  3  designed  to  amplify  a  genomic  region  including  BRCA1  exon  3.  Some  unaffected 
family  members  are  not  included. 

Figure  2.  Variant  allele  reflecting  BRCA1  g.  12977  ins  10  del  1039  in  Family  5  detected  by 
Long  PCR  of  genomic  DNA.  Numbers  above  lanes  are  identification  numbers  from  the 
pedigree.  The  wild-type  genomic  product  extends  from  BRCA1  intron  2  to  intron  3  and  is 
5531  bp.  Amplification  from  the  mutant  allele  yields  a  4502  bp  product  The  variant  band  is 
present  in  all  individuals  with  the  B/?G4/-linked  haplotype  and  who  are  heterozygous  for 
deletion  of  exon  3  in  BRCA1  mRNA. 

Figure  3.  Exon  skipping  resulting  in  a  truncated  BRCA1  protein  in  Family  5. 

(A)  Wild-type  BRCA1  sequence  and  splicing  of  exon  3.  nt  210-240  of  HSU  14680  shown. 

(B)  Breakpoint  sequence  of  the  1039  bp  deletion  and  effects  on  splicing  in  Family  5. 

nt  210-221  of  HSU  14680  and  10  bp  inverted  duplication,  followed  by  intron  3  sequence. 


Table  1:  Long  PCR  primers  used  for  analysis  of  BRCA1  gene. 
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a.  Long  PCR  was  carried  out  using  the  Boehringer  Mannheim  Expand  Long  Template  PCR  System 

b.  Temp.  =  Annealing  Temperature 

c.  Region  of  GenBankL78833  amplified  by  primer  pair 
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